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Abstract. For words of length n, generated by independent geometric random vari- 
ables, we consider the average value and the average position of the rth left-to-right 
maximum, for fixed r and n — > oo. 



I. Introduction 

For a permutation o~\o~\ . . . o~ n , a left-to-right maximum (outstanding element, record, 
... ) is an element o~j with o~j > o~i for alH = 1, ... ,j — 1. The number of left-to-right 
maxima was first studied by Renyi ||, compare also @. A survey of results on this 
topic can be found in Q. 

Recently Wilf in JlO| proved the formula (1 — 2~ r )n for the average value of the rth 



left-to-right maximum, for fixed r and n — > oo; for the average position he obtained 
the asymptotic formula (logn) r_1 /(r — 1)!. 

In [[| the number of left-to-right maxima was investigated in the model of words 
(strings) a% . . . a n , where the letters o, e N are independently generated according 
to the geometric distribution with F{X = k} = pq k ~ 1 , with p + q = 1. (We find it 
useful also to use the abbreviation Q = q^ 1 .) The motivation for this work came from 
Computer Science. Also, since equal letters are now allowed, there are two versions that 
should be considered in parallel, the standard version, and the weak version, where '<' 
is replaced by '<,' which means that a new maximum only has to be larger or equal 
to the previous ones. The paper || contains asymptotic results about the average 
and the variance of the number of left-to-right maxima in the context of geometric 
random variables. (H.-K. Hwang and his collaborators obtained further results about 
the limiting behaviour in 0.) 

Motivated by Wilf's study we consider here the two parameters 'value' and 'posi- 
tion' of the rth left-to-right maximum for geometric random variables. Summarizing 
our results, we obtain the asymptotic formulae r - and ^zfj] (| logq n Y resp. y and 

(p logQ n) T 1 m the weak case. 

A certain knowledge of || might be beneficial to understanding the present derivations. 

It should be noted that not all random strings of length n have r left-to-right maxima. 
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Let us start with the value. The generating function of interest is 

which originates from the (unique) decomposition of a string as a\W\ . . . a r -\W r -\a r w 
where ai,...,a r are the left-to-right maxima, the Wi are the strings between them, 
and w can be anything. Note that if = I, then this corresponds to a term pq l ~ 1 zu, 
and thus Wk corresponds to l/(l — (1 — q k )z). A value i must not necessarily occur 
as a left-to-right maximum; that is reflected by the 1 + . . . in the product. However, 
when we look for the coefficient of u r ~ l ) we have seen r — 1 left-to-right maxima, and 
the rth has value h. What comes after that is irrelevant and covered by the factor 
1/(1 — z). (Compare |J for similar generating functions.) 

In the sequel we find it useful to use the abbreviation [z] := 1 — (1 — q l )z. 

The coefficients of z"-u r_1 , call them 7r^\, are not probabilities, but ^h/^n^ are > where 

(r) 

Tin is the probability that a string of length n has r left-to-right maxima; we find it 

as 

Now we turn to the position. Set 

" J 1 i-^iil l-(l-g fc )^J 

then er„]/ vri r ' ) is the probability that a random string of length n has the rth maximum 
in position j. It is the same decomposition as before, however, we are not interested 
in the value h, so we sum over it. On the other hand, we label the position with 
the variable v , so we must make sure that every z that does not appear in the factor 
1/(1 — z) must be multiplied by a v. Computationally, we find it easier to work with 
the parameter "position — r," for which we have to consider 

h>l k=l k v H ' J 

since the variable v appears in fewer places, as we don't have to multiply all those z's 
by v which count for the r left-to-right maxima. 

2. Some technical lemmas 

In order to read off coefficients, we state the obvious but nevertheless very useful 
formula 

[w n ]J2aJ(hw) = 5>6£. [w n ]f(w). 

i i 

In all our applications, Y2i a $i can summed in closed form. 
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Lemma 1. Assume that we have power series 

#H^fl«< j = l,...,8. 



n>l 

Then 



(s) (i) 

Ch] ] ... Qji 



l<ii<i2<---<«s 0=lo<h<-<ls-i<ls=n 



Proof. For the sake of clarity, we treat the case s = 3 and leave it to the imagination 
of the reader to figure out the general case; 

[w n ] Mwq l )B{wq j )C{wq h ) = ^ ^ [w n ~ l \A(wq i )B(wq j ) ■ [w l ]C{wq h ) 

l<i<j<h I l<i<j<h 

= E nTT7 ^ [™ n ~ l \A{w<i)B{wqi) ■ [w l }C(wW l 
i ^ i<i<j 



= En73T E [w n }A(wq*)B(wq>)(w<f) 1 

l ^ l<i<3 

= E nTTT E [™"\ A (W) ' K]BW)W)' 

= E (q ._^._ 1) EKM(W)(Wr 

Q bm-l dr 



rn' 



^(Ql_l)(Qm_l)(Q»_l)- 



Lemma 2. 

l<il<i2<---<is 

= w E °!i-'o • • • a V-i s -i n ( u _ i + t (ni?_ i) 

0=l <h<-<l s -i<l s =n i=l V ^ 7 

Proof. The proof is essentially the same as before, if we note that 



□ 



□ 
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Our quantities will eventually come out as alternating sums, and the appropriate treat- 
ment of them is Rice's method which is surveyed in ||; the key point is the following 
Lemma. 

Lemma 3. Let C be a curve surrounding the points 1,2, ... ,n in the complex plane 
and let f(z) be analytic inside C. Then 

gQ (-i)Vm = /[M/M*. 

where 

(-If- 1 ™! _T{n + l)T(-z) 
^' ^ ~ z{z-l)...{z-n) ~ T{n + l-z) ' 

Extending the contour of integration it turns out that under suitable growth conditions 
on f(z) (compare ||) the asymptotic expansion of the alternating sum is given by 

Res([ra; z]f(z)) + smaller order terms 

where the sum is taken over all poles z different from 1, . . . , n. Poles that lie more to 
the left lead to smaller terms in the asymptotic expansion. 

The range l,...,n for the summation is not sacred; if we sum, for example, over 
k — 2, . . . , n, the contour must encircle 2, . . . , n, etc. 

3. The probability that there are r maxima 

Now we want to read off the nth coefficients of the power series of interest. For this, it 
is beneficial to use the following formula: 

WW = (-i)> B ](i -«;) B - 1 /(^)- 

This form can be found in || and is based on ideas concerning the Euler transform 
in [Q. Then the quantities come out automatically as alternating sums, and Rice's 
method can be applied. 



7T 



(r) 



1 



I - ; ^ I / 

l<ii<— <i r -i<h 



pqii i z pq l r~i 1 Z 

zpq 



V\ r r , 1 



E 



qii z q l r-i z h 



q) 1 — z 

1<H<— <tr_i<A 

i) n [w n )(i - w)™- 1 y - , 

' 1 n ; ^ (1 — q ll w) ... (1 — q lr - 1 w) 

l<tl<-<ir_l<7l v H ' v H ' 



-in-Vf- 



q ll w . . . q %r ~^w 



wq ' 



/ p \ t ^ f fj, X \ ^ Cj^^ IV G^ r ^~ IV 

~ 1)r \q) Z2\k-l) { ~ 1)k[wk] . 2-J (1 q<Ur)...i\ - q- <r)"" 



-wq ' 



k=r 



l<ii<—<i, — i<h 
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Now the evaluation of the inner sum can be done by our Lemma; k = n, r = s, 
AW(w) = ■■■ = A( r ~V (w) = A<r)(w) = w. Therefore 



Thus 



with 



r fci Q W ■ ■ ■ 1 W h 

[ w } > T, ~ \ J- : 

11 ^ (1 - q^w) ... (1 - q l ^w) 

l<h<- <i r -l<h V * ' V ^ 



«„< tal S<...< I ^«- 1 )« fe - 1 )---w''- 1 : 



E 



n-1 



„(r) 



ir 1 ©" E (V)<-i)*/<*) 



k=r— 1 



/W 2 < b <S* 1 « b - 1 )-«''- 1 )' 

In order to apply Rice's method one needs the continuation of f(k) to the complex 
plane. Using symmetric functions, one can always represent such iterated summations 
by powersums @ 

"W-Ew^rp. 

and the task is reduced to continue this quantity $(k) to the complex plane. For this, 
the standard way of doing it is via 



^ : ^ (Q l ~ l) d ^ (Q l+Z ~ 1 

l>2 V ^ 7 Z>1 V ^ 



However, we only need the values /(0), . . . , /(r — 2). 

In we learn how such a sum has to be interpreted; we thus find /(l) = • ■ • = 
f(r - 2) = and 

'<°> = (-^(o^r ■ 

Hence Rice's method and the pole at z = give us 

^1+0(1), 

which is intuitively clear. 

Note that there are poles at z = — 1 + 2wik/ logQ, k G Z, and they lead to a periodic 
fluctuation of order -; this phenomenon is well-known and appears in many places 
(compare fU and some other references). 
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4. The average value of the tth maximum 

Now we can safely deal with the quantities alone, and the so computated average 
value En^ will be correct within an error term of the form 1 + O(-). We compute 



= (-i r (£)'(-i). M( i-«, r . E (1 _^;;;n„^ ) ^ 
= (-ir(-)'Efr!)(-D*['»'] E » f"-^"" w ?, 

VfeV-V' .a^x, f 1 - 9*"") • • • t 1 - 

The evaluation of the inner sum is now done by the (second) Lemma: 



[«>*] r 7 — qw --- q / w . - m h h 



l=/l<«2<---<ir 



Or, 



with 



/w = w E n (q^t + 

l=il</ 2 <-</r=ifc+l »=1 V V ' 7 

Again, /(l) = • • • = f(r - 2) = and 

/e»=w e n^+'to^ip) 

l=Jl<i2<-<ir=l *=1 V V 7 7 



Q-i (q-i)V (g-i) r+1 



Thus we have proved the following theorem 
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Theorem 1. The average value of the rth left-to-right maximum in a random 
sequence of n elements, generated by geometric random variables is given by 

= — hO (—] for fixed r and n — > oo. 
p \n/ 

A full asymptotic expansion would be available, at least in principle, with more involved 
computations, as well as the variance. 

Again, as in all the examples that will follow, the lower order terms contain periodic 
fluctuations of the form 5(loggn). 



5. The average position of the tth maximum 

In order to compute this parameter (or rather the modified version), we have to differ- 
entiate the generating function from the Introduction and plug in v — 1. The desired 
quantity is then obtained via 

M(f)'l^W E 

* l<ii<-<i r _i</i j=l v 1 n IL JJI 7 



-i) n [w n \{\ - wy- 1 ^ {-iy 

q tj w q tj w(l — q^)w\ h 



x 

r- 1 



X M > I | I — ^ — : t^— - -^—)wq 

l<ii<-<i r _i<h j=l v y v * 

n—1 / 1 x 

= (f)'(-ir'E(";) (-dvw 



where 



A*) = [«^'iw e n ( t~^~t — - t^" w = /.w + aw 

^— ' 11 \1- fl'iw (1 — q % iwY I 
i<u<-<i r _i<h j=i v y v ^ y / 



with 

r-l 



g^w (1 — g^'w) 



l<u<-<V_i<fc j'=l ^ 



and 

r-l 



l<ii<-<i r _i<ftj"=l x ^ V ^ 7 / 

Now the two sums are in a form where our technical lemma applies! 
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For fi(k) note that A x {w) = • • • = A r „ x {w) = ^ - and A r (w) = w. Thus 

/l(A ° " [t] ^ (g* - 1) . . . (Q^ - 1) 

l=h<l 2 <-<l r =k K ^ J \^ ) 

For f 2 (k) note that A x {w) = ■■■ = A r ^ x (w) = ^ + (1 ^ )2 and = w. Thus 

f , M m (1 +t(Z 2 -ii-l)) ■■■(!+ t(Z r -Z r -i-l)) 
Mk) ~ [t] ^ W -!)••• " 1) 

l=Z 1 <Z 2 <...<Z r =fc+l I I 

= (k-r) ^2 (Qh _ i) 7qT t _iy 

As we know from before, it is the "value" (the behaviour) of /(0) that is required. It 
is /i(0) that is dominant here: Since in general 



a h . . . a lr ^ = (-1) 

we find that as z — > 



On — Gh 

2<l 2 <-<l r -l<0 U 



/(*) 



(Q-1)(Q*-1)-1 



Thus, according to the theory in ||, where it is explained in detail what kind of 
contribution an rth order pole at z = gives, we have proved that 

Theorem 2. The average position of the rth left-to-right maximum in a random se- 
quence of n elements, generated by geometric random variables is given by 

-. —y^ {~ 1°Sq + O ( l°g r_2 n ) f or fixed r and n — > oo. 

6. Weak left-to-right maxima; the value 

We mention here briefly the analogous developments for the instance of weak left-to- 
right maxima. 

The generating function of interest is 



and the coefficient of u r ~ l therein is 



1 -p-|- J 1 pq zu 1 



pq h 1 z, 



1 v-^ pq %1 1 z pq 1 *- 1 1 z h _, 
2^ ~- 77 • • • — —PV z 

\<i\<- ■ ■<i T -\<h 
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The technical lemmas that we need now are 
Lemma 4. 



(«0 (1) 
0>i /„ • • • di 



0<ii<i 2 <-<i s 0=J </i<-<i s -i</*=ra V y ' ' ' V y 

and 

K] ^ (wg 11 ) . . . (u?g l * ) (i s + 1) 

0<ii<i2<---<j s 

0=J <h<-<i s -i<i s =n i=l V y J 



We find 



fc=r- 1 ^ ' 



with 



l=Z 1 <i 2 <---</ r =fc+l i=l 

Also, /(0) = (-1)V IT ^ TT and thus 



Theorem 3. TTie average value of the rth left-to-right maximum (in the weak 
sense) in a random sequence of n elements, generated by geometric random variables 
is given by 

= h Of - I for fixed r and n — > oo. 

p \n/ 



7. Weak left-to-right maxima; the position 

The probability generating funtion of interest (up to normalization by a factor that is 
basically 1, as before) is given by 



h>l i=l 

and the desired expectated value is 



i V TT 1 1 pq vu 



(1 — q % )zv 



zvpq h 1 , 
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zq 



n-1 / _ 1 \ 

=p r (-ir 1 E [ k ji-^m 

k=r-l ^ ' 

with /(£;) = h(k) + f 2 (k) and 

h(k) = [t][v?\ E II f - 'TT^V 



0<ii<—<i r -i<h j=l 

and 

r-1 



0<ii< — <i r _i</l J=l 

We find 



and 



fl(k) — ~(k - 1) V r- pr 

v 7 v 1 ^ (1 - g' 1 ) ... (1 - q l A 

l=h<l 2 <-<l r =k V ^ 7 V ^ ' 



Mk) = (k-r) E (T^TT)' 



l=«l</ 2 <--<ir-=fe+l 

As z -> 

f( z ) ~ y —l 

and thus 

Theorem 4. TTie average position of the rth left-to-right maximum (in the weak 
sense) in a random sequence of n elements, generated by geometric random variables 
is given by 

— (p log n) r 1 + O ( log r ~ 2 n) for fixed r and n — > oo. 

(r — iy. 
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